Automatic recognition of continuous Cantonese speech with very large vocabulary

نویسندگان

  • Alfred Ying Pang Ng
  • Lai-Wan Chan
  • Pak-Chung Ching
چکیده

This paper presents the rst published results for automatic recognition of continuous Cantonese speech with very large vocabulary. The size of the vocabulary covered by this system is about the same as that encountered in the Hong Kong local Chinese newspaper, Wen Hui Bao (å×ø ). The system covers 6335 Chinese characters (r) and a large number of Chinese words (ü) can be formed by combining these Chinese characters. The input to the system is the end pointed speech waveform of a sentence or phrase, the output is the Big5 coded Chinese characters. In the development of the recognition system, we have devised new methods in 1) construction of a continuous Cantonese speech database, 2) lexical tone recognition in continuous Cantonese speech, and 3) integration of lexical tone and base syllable recognition results. The speaker dependent recognition rates for Chinese character, base syllable and lexical tone are 90.94%, 94.73% and 69.7% respectively.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tone recognition of continuous Cantonese speech based on support vector machines

Tone is an essential component for word formation in all tone languages. It plays a very important role in the transmission of information in speech communication. In this paper, we look at using support vector machines (SVMs) for automatic tone recognition in continuously spoken Cantonese, which is well known for its complex tone system. An adaptive log-scale 5-level F0 normalization method is...

متن کامل

Use of Tone Information in Continuous Cantonese Speech Recognition

Cantonese, a syllabically paced, southern Chinese dialect, is also a tonal language where tones carry important lexical information. It is rich in tonal variations and each syllable can have up to 9 different tone patterns. In this paper we investigate how to incorporate the tone information into a large vocabulary continuous speech recognition system. A two-pass, post-processing scheme is prop...

متن کامل

Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting

Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...

متن کامل

Automatic Recognition of Cantonese-English Code-Mixing Speech

Code-mixing is a common phenomenon in bilingual societies. It refers to the intra-sentential switching of two different languages in a spoken utterance. This paper presents the first study on automatic recognition of Cantonese-English code-mixing speech, which is common in Hong Kong. This study starts with the design and compilation of code-mixing speech and text corpora. The problems of acoust...

متن کامل

Tone information as a confidence measure for improving Cantonese LVCSR

Cantonese, a syllabically paced, southern Chinese dialect, is also a tonal language. A Cantonese syllable can have up to 9 different tone patterns which are lexically important. In this paper after reviewing major approaches to incorporating tone information into a large vocabulary continuous speech recognition (LVCSR) system, we propose two schemes to employ the tone information as a confidenc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997